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Summary:  The  fundamental  goal  of  this  research  project  is  to  comprehend  complex  collective 
behavior  in  human  society,  to  set  up  the  foundation  of  future  possible  inference  and  even  control 
of  social  collective  behavior.  Two  primary  technological  directions  have  been  fruitfully 
developed  to  accomplish  such  goal: 

•  Innovative  data  analytics  to  effectively  extract  critical  features  from  the  data,  to 
rapidly  recognize  the  abrupt  changes  of  data,  and  to  identify  the  network  relationship 
of  the  data.  In  addition  to  synthesized  data,  the  main  results  have  been  verified  from 
real  Internet  data.  A  new  social  network  model  associated  with  rate  rather  than 
traditional  graphical  properties  has  been  developed  to  make  inference  possible,  which 
has  been  verified  by  different  Internet  datasets. 

•  In-depth  comprehension  of  information  cascade,  a  collective  behavior  that  agents 
ignore  own  signals/observation  but  follow  others,  which  serves  as  a  foundation  of 
modern  social  warfare  over  Internet.  We  go  beyond  traditional  analytical  models 
of  information  cascade  and  develop  further  views  to  look  into  the  insights  of  such 
dynamic  system  to  enable  methods  to  “tilt”  collective  behavior  over  the  Internet 
or  over  human  society.  A  subsequently  new  security  mechanism  for 
low-complexity  sensor  networks  enables  trustworthy  operation  even  in  the 
presence  of  a  good  number  of  compromised  devices  based  on  social  learning  and 
information  cascade. 

Introduction: 

Since  the  Arabian  Spring,  linkage  between  online  opinion  dynamics  and  collective  behavior  in 
society  (particularly  political  related  movements)  has  been  received  a  lot  of  interest  and 
considered  as  a  potential  tool  for  social  warfare.  To  complete  understand  the  interplay  of  online 
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opinions  and  collective  human  behavior,  and  further  to  inter  (or  to  predict  with  statistical 
confidence)  potential  human/political/cyber  movement(s),  are  very  much  wanted  goals.  The 
following  figure  depicts  using  a  large  complex  dynamic  system  to  represent  the  complex  society. 
By  partial  observations  of  parts  of  the  complicated  society  (still  a  big  dataset),  the  model  of 
dynamic  networked  system  can  be  established  to  accomplish  our  ultimate  goal. 


The  goal  of  this  research  is  to  construct  new  fundamental  tools  and  model(s)  toward  the  ultimate 
goal.  Two  primary  technologies  have  been  identified  to  proceed  in  this  investigation: 

(a)  data  analytics  to  extra  networked  features  (i.e.  features  of  relationship),  then  develop  an 
effective  model  enabling  inference  that  was  not  possible  in  existing  literature 

(b)  in-depth  analytical  tools  to  comprehend  information  cascade  (i.e.  rational  agents  ignoring 
own  observations  to  follow  other  agents)  and  thus  collective  behavior  of  agents,  such  that 
enables  effective  influence  on  agents’  behavior. 

Above  original  and  fundamental  technologies  are  also  useful  to  develop  further  resilient  and/or 
proactive  defense  mechanisms  against  various  cyber-attacks  and  physical-attacks  in 
cybersecurity  and  human  society. 

Experiment:  Description  of  the  experiment s)/theory  and  equipment  or  analyses. 

Development  of  innovative  theoretical  model  and  methodologies  with  experimental  verifications 
by  realistic  Internet  data  has  been  executed  in  this  research. 

(a)  Data  analytics  toward  a  new  social  network  model: 

•  Communication  theoretic  data  analytics:  A  unique  differentiation  with  common  big 
data  analysis  for  this  research  lies  in  the  identification  of  hidden  information  that 
implies  critical  behavior/intention.  Such  “system”  identification  can  only  be  derived 
from  open  data  appearing  irrelevant  but  actually  containing  tiny  amount  of 
information.  The  methodology  based  on  communication  and  information  theory 
(thanks  to  leave  at  MIT  supported  by  this  research)  is  described  in  [Jl],  [C2],  and 
[C4],  in  which  how  to  select  useful  data  and  how  to  combine  useful  data  for  better 
information  extraction  are  uniquely  and  systematically  developed.  A  side  product  is 
[J5]  to  introduce  machine  learning  for  wireless  networks,  which  has  been  recognized 
as  IEEE  Popular  (top  100  papers  of  most  monthly  views  among  4M  papers  in  the 
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IEEE  Xplore)  since  publication. 

•  Similarly,  outlier  detection  (i.e.  abrupt  changes  in  behavior  or  events)  and  source 
separation  to  identify  potential  root-cause  in  data  analytics  have  been  studied  [C3] 
and  [C6]. 

•  New  rate-based  social  network  model:  Using  a  real  social  dataset  in  Taiwan’s  PTT,  it 
is  confirmed  that  large  social  events  and  online  opinion  dynamics  are  closely  coupled. 
Furthermore,  such  a  coupling  can  be  described  by  network  formation  and  statistical 
physics  (not  published).  Although  the  network  models  for  online  opinion  dynamics 
have  been  studied  and  developed  for  a  long  time,  existing  models  are  based  on 
network  topological  information  and  impossible  to  be  applied  for  effective  inference. 
After  all,  the  rate  information  from  data  is  used  to  establish  the  social  network  model, 
which  are  consistent  with  existing  topological  network  models.  Such  a  rate-based 
social  network  model  [C8]  is  possible  for  early  inference  prior  to  complete  network 
formation,  to  possibly  complete  the  ultimate  goal.  [J2],  [J3]  and  [C7]  are  side 
products  along  this  social  network  analysis  effort,  on  sensor  and  mobile  networks. 

(b)  Information  cascade  and  collective  behavior: 

•  Information  cascade:  Traditional  analytical  understanding  of  information  cascade  is 
totally  based  on  Bayesian  analysis,  which  is  insufficient  to  our  purpose.  We  examine 
information  cascade  as  a  dynamic  system  [Cl]  and  as  a  social  learning  mechanism  in 
details  [J4],  Furthermore,  by  incentive  seeding  and  rewiring  connections,  information 
cascade  can  be  successfully  tilted  under  various  types  of  random  networks  (since 
complete  observation  of  a  large-scale  social  network  is  impossible  for  an  outsider, 
he/she  can  only  treat  as  a  random  network  due  to  dynamic  social  network  topology) 
[C5].  A  thorough  view  on  information  cascade  is  documented  in  [J4],  an  unusually 
long  IEEE  journal  paper. 

•  Resilience  against  attacks:  Another  benefit  of  understanding  collective  behavior  is  to 
develop  resilient  mechanism  to  against  attacks.  [C9]  uniquely  shows  that  this  purpose 
can  be  achieved  by  leverage  social  learning  and  information  cascade,  and  illustrated 
by  infonnation  fusion  in  a  low-complexity  sensor  network,  even  a  good  number  of 
network  nodes  being  compromised. 

Results  and  Discussion: 

There  are  several  amazing  possibilities  to  develop  in  the  near  future  based  on  the  outcomes  of 

this  research. 

•  Social  warfare  has  been  identified  by  China  and  Russia.  By  more  verifications  on  large 
datasets  from  social  network  platforms,  our  research  results  are  ready  to  be  polished  and 
applied  for  real  implementation  of  (a)  early  warning/indicator  social/cyber  collective 
behavior,  and  consequently  tilt  collective  behavior  for  our  purpose  (b)  early  warning  of 
cyber-attacks  (i.e.  cyber  radar)  as  there  must  be  some  unusual  behavior  in  advance,  which 
can  be  possibly  obtained  by  social  network  analysis  on  rate-based  network  model  and 
communication  theoretic  data  analytics. 

•  Internet  of  battle  things  (IoBT)  emerges  as  an  important  technology.  Instead  of 
completely  secure  networking  and  computing,  as  almost  impossible  to  avoid  attacks 
(either  cyber  or  physical)  on  low-complexity  devices,  resilient  system  design  to 
accommodate  the  existence  of  compromised  nodes  is  the  way  to  avoid  tremendously 
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sluggish  secure  operations  to  prohibit  system  performance  of  original  design.  Again, 
compromised  behavior  has  unique  behavior  patterns  that  can  be  identified  through  social 
network  analysis. 

After  mid-May  2017  (project  ending),  research  efforts  on  this  project  have  been  continuing  to 
develop  a  method  as  an  early  indicator  to  rapidly  infer  collective  behavior  based  on  the  new 
social  network  model,  and  an  analytical  investigation  on  transient  formation  of  infonnation 
cascade. 
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Light  discussion  with  Dr.  Kevin  A.  Kwiat,  AFRL,  in  a  workshop.  Our  social  learning 
approach  on  information  cascade  is  highly  related  to  game  theoretical  study  in  AFRL. 
Under  the  support  of  Florida  Center  for  Cybersecurity  (2017-2018),  we  have  been 
investigating  resilient  dynamic  multi-agent  systems  against  cyber-  and  physical-attacks 
via  game  theory,  particularly  part  of  the  system  has  been  compromised. 
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